Workload-aware Trie Indexes for XML

نویسندگان

  • Sofía Brenes
  • Yuqing Wu
  • Hyungdae Yi
چکیده

Well-designed indexes can dramatically improve query performance. In the context of XML, structural indexes have proven to be particularly effective in supporting efficient XPath queries the core of all XML queries, by capturing the structural correlation between data components in an XML document. The duality of space and performance is an inevitable trade-off at the core of index design. It has been established that query workload can be leveraged to balance this trade-off and maximize the throughput of a group of queries. In this paper, we propose a family of novel workload-aware indexes by taking advantage of the recently proposed Trie indexes for XML. In particular, we propose the WP [k]-Trie, the AWP [k]-Trie, and the W[k ]Trie indexes, which use the P [k]-Trie framework to index frequent label-paths and a carefully selected complimentary set of label-paths. When a WP [k]-Trie index is available, all frequent path queries are guaranteed to be evaluated in one index lookup, and all core XPath queries are guaranteed to be evaluated with index-only plans. With further consideration of the representativeness of label-paths in the index and proper annotations, the AWP [k] and W[k ]-Trie indexes are able to improve query evaluation performance by efficiently singling out queries with empty results and enabling more efficient query decompositions, with the W[k ]-Trie minimizing the space requirements of the index.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Trie Indexes for Efficient XML Query Evaluation

As the number of applications that rely on XML data increases, so does the need for performing efficient XML query evaluation. A critical part of the solution involves providing new techniques for designing XML indexes and lookup algorithms. In this paper, we leverage the results of our research on coupling the partitions induced by fragments of XPath algebra and those induced by the structural...

متن کامل

Routing XQuery in A P2P Network Using Adaptable Trie-Indexes

The increasing popularity of XML and P2P networks has generated much interest in distributed processing of XML data. In this paper, we propose a novel solution for schema and content indexing in a P2P architecture. Our solution is based on a hierarchy of super-peers managing progressively compressed summaries of XML schemas. The main features of the extended tries used for schema and XML data i...

متن کامل

Implementing block-stored prefix trees in XML-DBMS

The problem of search efficiency through large amount of text data is well-known problem in computer science. We would like to introduce a BST data structure that allows searches through a set of string values, and is optimized for reading and writing large blocks of data. This paper describes the algorithms for insertion, deletion and search of variable-length strings in diskresident trie stru...

متن کامل

On the Index Selection Problem applied to Key oriented XML Indexes

In the world of Relational Database Management Systems (RDBMS) indexes are used to accelerate specific queries. The selection of indexes is an important task in database-tuning which is performed by a database administrator or an index selection tool which suggests a set of suitable indexes. In this paper we transfer the concept of specific indexes to XML Database Management Systems (XDBMS) and...

متن کامل

A Selective Key-Oriented XML Index for the Index Selection Problem in XDBMS

In relational database management systems indexes are used to accelerate specific queries. The selection of indexes is an important task when tuning a database which is performed by a database administrator or an index propagation tool which suggests a set of suitable indexes. In this paper we introduce a new index approach, called keyoriented XML index (KeyX), that uses specific XML element or...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009